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ABSTRACT 

This is ci review of studies designed to assess the 
attributes of teacher-student interaction predicted to be correlated 
vith student achievement. The paper includes: (1) a summary reviev of 
the types of ±eacher characteristics .*:Qund to be correlated with 
student achievement and attitude; (2) . a summary of the research done 
using the Flanders Interaction Analysis Index; and (3) a final 
' summary and conclusions section followed ty an extensive list of 
references. The review of available -research makes it clear that good 
experimental research in^the area of student-teacher interaction has 
^been relatively sparse^, !fhexe are a number of problems with existing 
'studies: primarily, nearly, all studies to date have been 
correlational in nature. Many have used scales designed for uscj with 
all grade levels with no allowances made. Consequently, findings have 
been inconsistent across grade levels. A third problem is that many 
studies and systems of observations do not include reliability data 
nor is the rating scale specific to the subject matter. Probably the 
most important shortcoming of ^research in this area is that few * 
scales take into account which studen'ts are interacting with the 
teacher and roost use a total class' mean as opposed. to individual 
evaluations in determining academic achievement. Two final p?:oblems 
are that studies use a relatively low number of teachers and an 
inadequate? number of observations ol each teacher to obtain 
significant data. (DBTT 
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J. 

1^ The original purpose of this review was to prepare for a study of classroom 

interaction. This study was to be designed to look at the relationship between 

'a 

1 1 i the information level of the teacher's verbal output and an estimate of the chile's 
itility to comprehend. On the teacher's side, analysis of verbal output was to be 
looked at in terms of , information contained in that output. On the student's 
side, a^ialysis of information available to the child would be ascertained by tests 
over :naterial to be covered by the teacher. It was then hoped that a way could 
be designed in which it would be possible to acquire an index of how much of the 
school day was y?elevant for specific students. Relevancy was to be defined in 
terns of whether or not the information which was being presented was new to the 
child and still comprehensible so that new learning would take place. That is, 
information that was not above the child's competency to understand or not redun- 
dant was to be classified as relevant. The idea of matching the level of child 
with level of information being acquired is of course not original and has been 
discussed elsewhere. In fact, that type of matching is what the open classroom is 
developed to provide. The philosophy of the present author is, however, different 
from thc^t of the open classroom advocates in two ways. First, it would give more 
emphasis to the role of the teacher both as a no^vator and also as a structurer 
of the environment. The teacher's role would be more to make major decisions 

^^ and then Justify these decisions to the child rather than emphasize the child's 

role in the decision making process. Second, more time would be spent in group 
work when there was evidence that v/orking with individuals would be a waste of 

^ » time. - 
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The lor.' terrf: ^-oilo ot this line of research would include (1) being able 
to '".ake pr--^ iictions concvmin*: which students in a classroom would progress most, 
basel upon thx? t^attern and level of information provided by the teacher, (2) 
ictually in titutint'^ experin^ntdl treatments in classi'ooms by specifying to teachers 
the tvpe of infor^.ation which should be provided to specific children. This 
woil! ir.cluJo iookinr at loth the qualitativre level of the child's functioning and* 
Quantitative ainount of informatipn available in deciding what type of input should 
I- vr'>v!^^i for naxirr,um learning. 

(lit: the a:ove objective in mind, the studies to be reviewed here are those 
whiirh w^re iMcIrnel to assess axtributes of teacher-student interaction which werq 
:reiictei to le correlated with some ^ort of student gain. That is, those studies 
which creii:tei that student -teacher interaction measures would be correlated with 
3t"iient aci.ievenier.t or student attitude. 

Tor those v;ho are interested, there are ,a number of mpre inclusive summaries 
t'3icher-siudent interaction studies already in existence (Withall and Lewis, 1963; 
M^dlev and Mitzel, 1963; Rosenshine, 1970; Maux, 1967; Aschner, 1963; Amidon and 
::iTr.ori, Levin, Hiltor and Leiderman, 1957; Rosenshine and Furst, 1971). For 

*:ho3e interested in looking at student-teacher rating instruments the most inclusive 
Calthoujjh not all inclusive) list along with detailed descriptions is included in 
a pablication by Simon and Boyer (l970). In addition. Research for Better Schools, 
Inc., publishes a monthly newsletter which includes up to date unpublished and 
published research in this area. 

The plan for the remainder of this paper includes (1) an initial summary review 
th*^ tyrec of teacher characteristics which have been found to be correlated with 

m 

rt'ident achievenent or attitude (2) a summary of the research done using the 
rian-ier^ Interaction Analysis Index (this is done separately since the Flanders 
3cale i::; the nost used index in this area) and finally (3) a summary and conclu- 



Sunir.ary of Research Findinp^s to Date 

The most recent extensive review of the interaction research using student 
perforrnance as a correlate of teacher behavior was the paper by Rosenshine and 
Furst (1971). These authcrr list 11 teacher variables which have been correlated 
t> varying degrees with student performance. Because this is a recent and extremely 
corprehensive review, the present author has used the categories of behavior used 
by Rosenshine and Furst which involve observed student-teacher interaction in 
iiL^cussirr the findings to date. The present author has also included the refer- 
ences u^ei by Rosenshine and Furst along with new references where appropriate. 

Before beginning: it is important to note that one of the more important dimen- 
sions along which these scales can be divided is the degree of inference required 
by the rater. That is, if the scale is merely a frequency count of behaviors then 
little inference on the part of the rater is required. On the other hand, if the 
r,cale asks the rater to measure the degree to which some ting exists in the class- 
room (e.g. teacher clarity) a gr^at deal of inference is required on the part of 
the rater. It is obviously true j that the more opportunity for inference the 
gteatev the opportunity there is^ for low inter-rater reliability and thus the less 
confidence one can have in the fjindings and the less chance there is that repli- 
cation can be acquired using different raters. It is important to keep this in 
mind when the results of studies are considered. 

The category of teacher behavior showing the highest relationship with student 
gain is clarity of presentation (Belgard, Rosenshine , and Gage, 1968; Fortune, 1967; 
Fortune, "a^e and Shutes, 1966; Soloman, Bezdek and Rosenberg, 1963; Wallen, 1966; 
'"hall and- Feldman, 1966). It is obvious that this is a high inference behavior 
on the part of the rater. What may be clear to the rdter may not be clear to the 
student. In fact, what may be clear to one student may not be clear to another. 
Rosenshine and Furst believe that future research should be directed toward more 
carefully defining behaviors which comprise clarity. The pr^nent author thinks 
it ir. interesting, that these behaviors would probably require careful lesson 
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plans and detailed lectures or presentations whichare somewhat against the current 
emphasis on discovery learning and open classrooms (one has to be careful here 
because correct use of the open classroom philosophy requires that there be a 
detailed plan for each child). 

A second category of benaviors associated with student gain hds been labeled 
variablity or variety. This refers to using many different teaching devices, 
types of testr, and lecture techniques (Anthony, 1967; Lea, 1967; Furst, 1967; 
Thompson anl Bowers, 1068; Soloman, Bezdek and Rosenberg, 1963; Torrance and Parent, 
1966; Walberg, 1969). This includes stur^iec which used both high and low inference 
rating scales. Unfortunately, when low inference definitions have b.een used (e.g. 
nuirier of specific types of behavior which were exhibited) no significant correla- 
tions letween the nuir.ber of teacher hehaviors and the student gain have been fc nd 
(Soar, 1966; Snider, 1966; Vorreyer, 1965). 

A particularly high inference behavior is that of teacher enthusiasm. Here 
again we find significant correlations with student performance but no specific 
behaviors (Fortune, 1967; Kleinman, 196U; Wallen, 1966; Soloman, Bezdek and 
Posenber^, 1963). 

The results of studies measuring a task oriented or business like behavior 
have also found relationships between such behaviors and student performance 
(Fortune, 1967; Kleinman, 196U; Chall and Feldman, 1966; Uallen, 1966). If teachers 
seem content oriented and less apt to emphasize having the children enjoy them- 
selves there Is a better chance that the children will learn. 

Criticism is a behavior which has been found to correlate negatively with 
student gain in a number of studies (Flanders, 1970; Hunter, 1968; Harris and 
Serweryl966; Anthony, 1967; Cook, 1967; Hunter, 1968; Harris etal, 1968; 
Soar, 1966; Wallen, 1966). It is interesting to note that it has been found 
that significant negative correlations only occur when severe criticism was used. 
Rosenshine says **In no study was there a significant negative, correlation between 
mild forms of criticism or control and student achievement*' (p. 51). These mild 
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for^.s include T<ainiy Information types of comments such as sayirig to the child that 
he i-^ incorrect or providing new academic directions. 

Several researchers have attempted to classify the types of questions which 
teachers ask ani then lookei at subsequent student achievement to determine 
whether there_is a relationship between the two variables. In general, the questions 
are rated on level of answer required by the teacher (e.g. straight recall or 
reviirine reasoning). Jhe positive studies find that teachers asking higher level . 
• jU^^t*: ionr. 'tl3o have higher achieving students (Kleinman, 1964; Spaulding, 1965H 
There ^re however, a nunber of such studies in which significant results were not 
obtainei ([{arris and ferwer, 1966; Harris et al , 1968, Perkins, 1965; Wright and 
:*uthall, i:^7}) '< 
The Flanders System of Interaction Analysis 

The TiOSt used instrument in recent years for recording student-teacher inter- 
action ha^> been the Flanders System of Interaction Analysis (FSIA). Actually the 
na'^e is sorewhat r.isleading since the emphasis is realw.y on the teacher's behavior. 
^tu:ient behavior is only recorded when the teacher is not engaged in any activity 
other than ot:>ervinfc!: or listening. The instrument includes seven categories of 
teacher behavior and 2 categories of student talk. The teacher behaviors (assumed 
to bo all inclusive of teacher behaviors) are divided into two broad categories 
of teacher influence labeled indirect and direct. The indirect influence cate- 
gories incluie (1) acc^^-pts feeling (2) praises and encourages (3) accepts or 
uses ideas of student (M-) asks questions. The direct influence categories include 
(5) lecturing (6) giving directions and (7) criticizing or justifying authority. 
The two student talk categories are (8) student talk-response and (9) student talk- 
initiation. Finally, there is a category 10 labeled silence or confusion. 

^ince this is a low inference scale in which behaviors are tabulated without 
requirit^g the observer to rrake any judgments, the degree of interrater reliability 
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becones quite hlrh <ifter a short training period. Although the instrument has 
been r.ost used in training teachers, there have been a number of studies which 
attempt to looV at behavior patterns of teachers with the goal of determining 
T.ore effective teacher behavior patterjis. The operational definitions of effective 
Include both student attitude and student achievement. 

Flanders himself reports seven stucjies which he conducted between 1955 and 1967 
th'-v are included in two publications (Flanders, 1965, 1970). For the first six 
^^^udi^-;- ( i^inr^, ^Trades 7, u, 8 and 6) the procedures followed were essentially the 
sarr.e : 

(1) An inventory assessing positive pupil attitudes were administered to a 

^inple of Classrooms. This sample was chosen so as to be representative of a 

lart^er population of similar classrooms in a given geographical area. 
■> 

(2) A7erat!;e scores on the inventory were '".alculated for each class. The classes 
located at the extremes of the resulting distribution of scores were selected 

for observation, except for projects 5 and 6 vjhich also included classrooms 
selected from the middle of the distribution. The purpose of selecting 
extreme classrooms was to^ increase the range of interaction patterns in the 
study. However, average attitude scores are far from perfecfy correlated 
with interaction patterns, so at best this procedure only increased the odds 
that here would be vider variation among classrooms with respect to inter- 
action patterns. 

(3) The classes so isolated were then observed and the classroom interaction 
coded by trained observers. Except for projects ] and 2 an assessment was 

made of content achievement before and after the observations* Final achieve- 
ment scores could then be adjusted according to intial achievement by a 
regression techniquvi (Flanders, 1970; p. 390-391). 

The seventh study which used second grade teachers did not include the 
initial pupil attitude inventory but raerely selected teachers at random. This 
study did not produce the same results as the other studies and Flanders blames 
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the different sample for these deviant findings. 

In all studios the teachers had to volunteer to have their students sampled 
before they were included. The major hypotheses in most studies seemed to be that 
the ir.purtant differences between teachers who had students with positive attitudes 
fiom teachers who had students with negative attitudes was in the amount o^ indirect 
teacher tehavior. CtuJents of indirect teachers should achieve more and have more 
positive attitudes. Indirect teachers were defined by the two ratios of I/D (where 
I .tijiic: for all indirect behaviors and D for all direct behaviors) and i/d (where i 
Lc the sare as I but d only includes category 7). The hypotheses was tested in some 
.studies by comparing mean amount of indirect behavior of teachers of students who 
hal the noot positive attitudes with mean amount of indirect behavior of teachers 
of students who had the least positive attitudes or ranking teachers on the 
aTount of indirect behavior exhibited and comparing student achievement or student 
attitudes of students of teachers exhibiting the most indirect behavior with 
stu'ients of teachers exhibiting vhe least indirect behavior. This latter pro- 
cedure has some problems in that the students of indirect teachers have generally 
beer, found to be smarter (higher initial achievement scores) than students of 
direct teachers. This has led to the necessity of a statistical adjustment for 
achievement scores, '^en this adjustment is made, Flanders found that students 
of indirect teachers had better attitudes and higher achievement scores than 
atuients of direct teachers. 

For the final four studies, Flanders used a number of correlational ^tech- 
niques (factor analysis, multiple 'regression) to obtain the best predictors 
of Student achievement arW^ student attitude for each study. In these cases the 
results were less consistent and Flanders concluded, '*If future studies have 
similar results, then the following generalization would be supported: a single 
interaction analysis predictor is not likely to be -associated equally well with 
different outcome variables, different grade levels and different learning activities" 



(p. 3^7). 
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In 5pit^3 :>t obvious Ghortconini^s of field research, the advocates of the 
TMr rav^> 'C }lloctei an iirpressive amount of data. Some of its most ardent advo- 
cate. > even s^s; the scale ic an important advance in understanding the process of ' 

\ 

teachin,-. (Morrli>;^n, r>^^-7; Pankrat::,, 1^67 ; Powell, 1967; Weber, 1965; Campbell and 
Parr.eG, 1959). Ae^ain', most of the studies report more positive results from 
tea^iher^, c iter.orizf^d a^s indirect. 

Tne of the more impressive attempts at making use of the interaction analysis 
[\ ^hat :f oar (l-*68) who after anlyzing interaction patterns of teacher behavior 
an- iifr-:ent treasures of student growth obtained from 54 elementary school class- 
rooms ^urrcL-t^- 1 th._it there may be different curvilinear relationships between amount 
of tvac!*er iniirectnes3 (or directness using the criticism measure) exhibited by 
the teacha?3 and different student attitude or achivement outcomes. 

The maior problen v;ith this whole line of research as the present author sees 
it is finally stated by Flanders (1967) on the next to last page of his book, where 
Flanders cays, "Does indi^^ect teaching behavior cause more learning and more 
^'O-^itive attituieo or do bri^^hter youngsters who can learn more and are more likely 
t'j have positive attitu<fes provide a teacher with the opportunity to be more 
indirect? .he projects reviewed shed very little light on this issue" (p. U26). 
?ur"narv an.! Conclusion 

From the above review it should be clear that good experimental research in 
thi: area has Leen relatively sparse and indeed several of the other reviews begin 
with a plea for more and better research in the area* In general, there are a number 
of r:laring problems with the research done so far. In the first place, nearly all 
^.tuli^i:> to date have been correlational in nature even though it is not uncommon 
for the authors to infer causation. In fact, to the present author's knowledge 
there has been only one study which actually asked teachers to exhibit a specific 
Ire'iavior pattern (Worthen, 1968) and then monitored the teachers* behavior. This 
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was not d sttidy- of teacher student interaction but rather a comparison of the 
effectiveness of two teaching t^hniques (discovery and expository). Interestingly 
enout^h the same author (Worthen and Collins, 1971) later had to repudiate his 
earlier results on the basin of incorrect analysis (using the wrong degrees of 
freedom). Clearly, before we can make inferences about the relative effectiveness 
of different teacher-student interaction patterns we must have some experimental 
studies. 

A ^econd weakness of many studies has been the fact that they used scales 
which are designed to be used with all grade levels for no allowance for differences 
anong erade levels. As reported earlier, findings have not been consistent across 
^rade levels leading to the suspicion that different teacher behaviors may be 
exhibited at different grade levels. 

VJith the exception of the early classic studies of Anderson and- Brewer (1963, 
ir-tif) few observational studies of any kind have been done with preschool children. 
i<at2 (ll^tj) in a recent survey found only one preschool study which used observation 
of in-class behavior. 

A third problem is that many of the studies and observational systems do not 
include any reliability data at all (for an excellent discussion of reliability 
problems encountered in using observational instruments see Mitzol and Medley, 
1965). Obviously, unless reliabilities across time and rater are established any 
experimenter using the scale is treading on very shallow ice. 

Another problem is that most rating scales are designed to be used for all 
subject matters. While is it clearly deeirable to have one all purpose instrument, 
it is still possible that different subject matter teachers exhibit different be- 
haviors and that this possibility should at least be investigated. 

Probably one of the most important shortcomings of the research in this area 
Iz thct few of the scales take into account which students are interacting with 

10 



-10- 



the teacher. For Instance, the FSIA has no provision for identifying the frequency 



with which a teacher interacts with a specific student. Good and Brophy (Good 



1^71; Brophv and Good, 1970) have been particularly vocal in this point. 



They point cut that this procedure implies that interaction variables being studied 



are properly conceptualised as interactions between the teacher and the class as 



a 'iuit rather than as an interaction between the teacher and a single student 
and that teachers are consistent across students in their classroom behavior so 
t^.^it in!Ivllual differences within a classroom are of little or nd importance rela- 
tive to ;ntercl33S differencoo. These implications are clearly questionable at best 
mi "ooi ^ni Erophy quote a nu.r.ler of studies including their own work to show 
that teachers io indeed interact with different students in different ways (Brophy. 
and :*ood, 1970; Lavis and Dollard, 1940; Becker, 1952; Hoehn, 1954; DeGroat and 
Thor^.rcon, 1?U9; Meyer and Thompson, 1956; Davis and Slobodion, 1967). 

A closely related problem is that rost studies use in their analysis of 
i^il. ^ain a total class mean rather than looking at gains of specific chil'dren 
(e.r. children who receive teacher attention). Any study which uses a class mean- 
can obviously ho masking important effects which are occurring within the class- 
roor, to particular children who exhibit a particular characteristic or belong to 
a particular ijroup. The problem of course is determining the important student 
characteristics. 

Two final problems are that studies generally use a relatively low number of 
teachers and an inadequate number of observations on each teacher. Nearly all 
-experimenters assume that teachers exhibit relatively stable characteristics and 
the possibility that it is the students which dictate the behavior of the teacher 
seems to be ignored. In fact, no study to the present author's knowledge has 
observed the same teacher teaching more than. one class of children in which each 
class has a specifiable difference in student makeup (e.g. intelligence, social clas^^etCf). 
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In surnr.jry, a tr>*ni»:*ndous amount of work mus*!; be done in the area of research 
iesi-^n before we can begin to^ have confidence in research findings which attempt 
to measure student-teacher interaction patterns. Until w^^ begin to utilize the 
experimental 'desi<;^n and retreat from the correlational design this entire area 
of research will contribute very little to our knowledge of the teaching process. 
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